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Abstract. We consider some questions related to the signs of Hecke eigenvalues 
or Fourier coefficients of classical modular forms. One problem is to determine to 
what extent those signs, for suitable sets of primes, determine uniquely the modu- 



^vq | lar form, and we give both individual and statistical results. The second problem, 

which has been considered by a number of authors, is to determine the size, in 
terms of the conductor and weight, of the first sign-change of Hecke eigenvalues. 
Here we improve the recent estimate of Iwaniec, Kohnen and Sengupta. 



(N 

i — i! 1. Introduction 

H 

There are many results in the arithmetic of modular forms which are concerned 
with various ways of characterizing a given primitive cusp form / from its siblings, 
starting from the fact that Fourier coefficients, hence the L-function, determine 
uniquely a cusp form / relative to a congruence subgroup T of SL(2, Z). Among such 
results are stronger forms of the multiplicity one theorem for automorphic forms or 
representations, various explicit forms of these statements, where only finitely many 
coefficients are required (say at primes p ^ X, for some explicit X depending on the 
parameters defining /), and a number of interesting "statistic" versions of the last 
problem, where X can be reduced drastically, provided one accepts some possible 
CN ■ exceptions. Among other papers, we can cite [4], [16], [3] or [9]. 

Some of these statements were strongly suggested by the analogy with the problem 
of the least quadratic non-residue, which is a problem of great historic importance 
in analytic number theory, and there are many parallels between the results which 
have been obtained. However, this parallel breaks down sometimes. For instance, 
in [13], Lau and Wu note that one result of [12] for the least quadratic non- residue 
is highly unlikely to have a good analogue for modular forms. This result (see [12, 
Th. 3]) is a precise estimate for the number of primitive real Dirichlet characters of 
modulus q ^ D for which the least n with x{ n ) = ~ 1 is ^ l°g-D, an d the difficulty 
is that this estimate can be understood by assuming that the values x{p)i f° r P °f 
moderate size compared with D, behave like independent random variables taking 
values ±1 equally often. However, Hecke eigenvalues may take many more than 
two values, and thus assuming that they coincide should definitely be a much more 
stringent condition. 
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In this paper, we consider a way to potentially recover a closer analogy: namely 
(narrowing our attention to forms with real eigenvalues) instead of looking at the 
values of the Hecke eigenvalues, we consider only their signs (where we view 
as being of both signs simultaneously, to increase the possibility of having same 
sign). Then classical questions for Dirichlet characters and modular forms have the 
following analogues for signs of Hecke eigenvalues A/(p) of a classical modular form 
/: 

• What is the first sign-change, i.e., the smallest n ^ 1 (or prime p) for which 
\f{n) < (or \f(p) < 0)? (Analogues of the least quadratic non- residue) . 
Note a small difference with quadratic characters: it is not true here that the 
smallest integer with negative Hecke eigenvalue is necessarily prime; finding 
one or the other are two different questions. 1 

• Given arbitrary signs e p G {±1} for all primes, what is the number of / (in a 
suitable family) for which A/(p) has sign e p for all p ^ X, for various values 
of XI (Analogue of the question in [12]). 

• In particular, is there a finite limit X such that coincidences of signs of \f(p) 
and e p for all p ^ X implies that / is uniquely determined? (Analogue of 
the multiplicity one theorem). 

Of these three problems, only the first one seems to have be considered earlier, 
with the best current result due to Iwaniec, Kohnen and Sengupta [6]. We will 
improve it, and obtain some first results concerning the other two problems. We 
will also suggest further questions that may be of interest. 

Before stating our main theorems, here are the basic notation about modular 
forms (see, e.g., [7, Ch. 14] for a survey of these facts). We denote by R* k (N) the 
finite set of all primitive forms of weight k for F (N), where k ^ 2 is an even integer 
and iV ^ 1 is an integer. The restriction to trivial Nebentypus ensures that all 
Fourier coefficients are real, and for any / e H£(iV), we denote 

oo 

f( z ) = \ } {n)n^l 2 e(nz), e(z) = e 2iwz , (Smz > 0), 

n=l 

its Fourier expansion at infinity. Since / is primitive, the Xf(n) are the normalized 
eigenvalues of the Hecke operators T n , and satisfy the well-known Hecke relations 

(1.1) Xf(m)Xf(n) = Yl X f 

d\ (m,n) 
(d,N)=l 

for all integers m ^ 1 and n ^ 1. In particular, Xf is a multiplicative function of n 
(so A/(l) = 1) and moreover the following important special case 

(1.2) X f {pf = 1 + X f (p 2 ) 

holds for all primes p \ N . 

Furthermore, it is also known that Xf(n) satisfies the deep inequality 

(1.3) \X f (n)\^r(n) 

1 E.g., as a random example, for the cusp form of weight 2 associated to the elliptic curve 
y 2 = x 3 + x, the first negative coefficient is A(9) = —3, and the first negative prime coefficient is 
A(13) = -6. 
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for all n ^ 1, where r(n) is the divisor function (this is the Ramanujan-Petersson 
conjecture, proved by Deligne). In particular, we have Xf(p) G [—2, 2] for p \ N, and 
hence there exists a unique angle Ofip) G [0, 7r] such that 

(1.4) A/(p)=2ooBtf/(p). 

Our other notation is standard in analytic number theory: for instance, 7r(x) 
denotes the number of primes ^ x and P + (n) (resp. P~{n)) denotes the largest 
(resp. smallest) prime factor of n, with the convention -P + (l) = 1 (resp. P~(l) = 
oo). 

We now describe our results. 

1.1. The first negative Hecke eigenvalue. For / G H£(jV), k ^ 2 and TV ^ 1, it 
is well-known that the coefficients Xf(n) change sign infinitely often. We denote by 
rif the smallest integer n ^ 1 such that (n, N) = 1 and 

(1.5) A/(n) < 0. 

The analogue (or one analogue) of the least-quadratic non-residue problem is to 
estimate n/ in terms of the analytic conductor Q := k 2 N. Iwaniec, Kohnen and 
Sengupta [6] have shown recently that 

n f < Q 29/60 = (A; 2 iV) 29 / 60 

(here, standard methods lead ton/ <C £ Q 1 ^ 2+e , so the significance is that the expo- 
nent is < 1/2). 

Our first result is a sharpening of this estimate: 

Theorem 1. Let k ^ 2 be an even integer and N ^ 1. Then for all f G H^(JV), we 

(1.6) n, « Q 9/20 = (PiV) 9 / 20 , 
where the implied constant is absolute. 

This bound is not the best that can be achieved using our method, and we will 
comment on this after its proof (in particular, an interesting function (5 occurs when 
trying to push the idea to its limit). 

One automatic improvement of the exponent arises from any subconvexity bound 
for the relevant L-functions, as already observed by Iwaniec, Kohnen and Sengupta. 
We do not need such deep results to prove Theorem 1, but we will state below the 
precise relation. 

We do not know if the estimate of Theorem 1 holds for the first negative Hecke 
eigenvalue at a prime argument. 

1.2. Statistic study of the first sign-change. The upper bound (1.6) is probably 
far from optimal. Indeed, one can show that under the Grand Riemann Hypothesis 
we have 

n f < (log(A;iV)) 2 

where the implied constant is absolute. Our next result confirms this uncondition- 
ally for almost all /. It closely parallels the case of Dirichlet characters (see [12]). 
Precisely, we first recall that 

|H*(iV)|xMA0, 
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where <p(N) is the Euler function, as k, N — > +00, and we prove: 

Theorem 2. Let v ^ 1 be a fixed integer and & be a set of prime numbers of 
positive density in the following sense: 

El 8 
p logz 

for some constants 5 > and z > 0. Let {e p } p£ ^> be a sequence of real numbers 
such that \e p \ = 1 for all p. Let k ^ 2 be an even integer and N ^ 1 be squarefree. 
Then there are two positive constants C and c such that the number of primitive 
cusp forms f e K* k (N) satisfying 

e p \ f (p u ) > for p6#, p\ N and Clog(kN) < p <: 2C\og(kN) 

is bounded by 

7 at ( logkN \ 
^ eX H- C loiio^iV> 
Here C, c and the implied constant depend on v and only. 

Taking & the set of all primes, e p — 1 and v — 1 in Theorem 2, we immediately 
get: 2 

Corollary 1. Let k ^ 2 be an even integer and N ^ 1 be squarefree. There is an 
absolute positive constant c such that we have 

n f < log(fciV), 

for all f G H£(iV) 7 except for f in an exceptional set with 

log kN 



< fc7Vexp(-c- 



log log kN 

elements, where the implied constants are absolute. 

It is very natural to ask whether this result is optimal (as the analogue is known 
to be for real Dirichlet characters). In this direction, we can prove the following: 

Theorem 3. Let N be a squarefree number and k ^ 2 an even integer, and let (s p ) 
be a sequence of signs indexed by prime numbers. For any e > 0, e < 1/2, there 
exists c > such that 

PpQjK/ e H fcW I A/(p) has sign e p for p^z, p\N}\ > " ^ 

for z = cyj (log kN) (log log kN) , provided kN is large enough. 

One may expect that the same result would be true for z ^ c log kN (note that 

/l\w(clogfcJV) / \ogkN \ 

\2) ^ eXP {- Cl hghg-kNJ 

so this result would be quite close to the statistic upper-bound of Corollary 1, and 
would essentially be best possible, confirming that the signs of Xf(p) behave almost 
like independent (and unbiased) random variables in that range of p). 



The cases where v ^ 2 can be interpreted as similar statements for the z/-th symmetric powers. 
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Theorems 2 and 3 will be proved in Section 3, using the method in [13] and 
quantitative equidistribution statements for Hecke eigenvalues, respectively. 

1.3. Recognition of modular forms by signs of Hecke eigenvalues. Here we 
consider whether it is true that a primitive form / is determined uniquely by the 
sequence of signs of its Fourier coefficients A/(p), where we recall that we interpret 
the sign of in a relaxed way, so that has the same sign as both positive and 
negative numbers. 

The answer to this question is, indeed, yes, and in fact (in the non-CM case) 
an analogue of the strong multiplicity one theorem holds: not only can we exclude 
finitely many primes, or a set of primes of density zero, but even a set of sufficiently 
small positive density. Here, the density we use is the analytic density defined as 
follows: a set E of primes has density k > if and only if 



We will prove: 

Theorem 4. Let ki,k 2 ^ 2 be even integers, let Ni,N 2 ^ 1 be integers and f\ e 



(1) If the signs of A^(p) and \/ 2 (p) are the same for all p except those in a set of 
analytic density 0, then f\ — / 2 . 

(2) Assume that neither of f\ and fi is of CM type, for instance assume that N\ 
and N2 are squarefree. Then, if A f 1 (p) and A / 2 (p) have same sign for every prime p, 
except those in a set E of analytic density k, with k ^ 1/32, it follows that f ± — f 2 . 

Recall that a form / e H* k (N) is of CM type if there exists a non-trivial primitive 
real Dirichlet character \ such that A/(p) = x(p)A/(p) for all but finitely many 
primes p. In that case, A/(p) = for all p such that xip) — ~ 1) an d hence its signs 
coincide (in our relaxed sense) with those of any other modular form for a set of 
primes of density at least 1/2. 

Of course, Theorem 4 is also valid for the natural density, since the existence 
of the latter implies that of the analytic density, and that they are equal. As a 
corollary, we get of course: 

Corollary 2. For any sequence of signs (e p ) indexed by primes, there is at most 
pair (k, N) and one f G R* k (N) such that \/(p) has sign e p for all primes. 

Theorem 4 is proved in Section 4. The argument is short and simple, but it 
depends crucially on a very deep result: Ramakrishnan's proof [18] the Rankin- 
Selberg convolution L-function is the L-function of some modular form on GL(4). 

1.4. Motivation, further remarks and problems. The main remark is that, 
underlying most of the problems we consider is the Sato- Tate conjecture, which we 
recall (see Mazur's survey [14]): provided / is not of CM type (for instance, if iV is 
squarefree), one should have 



(1.7) 





H*^), / 2 eH* 2 (7V 2 ). 




6 



E. KOWALSKI, Y.-K. LAU, K. SOUNDARARAJAN & J. WU 



for any a < j3, where ust is the Sato- Tate measure 

2 2 
Ust = — sin 6 d6, 



7Y 



on [0, 7r]. Since (j,st([®, k/2]) = /xsr([7r/2, 7r]), this indicates in particular that the 
signs of Xf(p) should be equitably shared between +1 and —1. This suggests and 
motivates many of our results and techniques of proof. 

There is much ongoing progress on the Sato- Tate conjecture; for / G H£(iV), 
non-CM, a proof of the conjecture has been announced by Barnet-Lamb, Geraghty, 
Harris and Taylor [1, Th. B]. However, knowing its truth does not immediately 
simplify our arguments. Indeed, it would be most immediately relevant for parts of 
Section 4, but Theorem 4 is really concerned with the pair- Sato- Tate conjecture, or 
property, which would be the statement, for a pair (/i,/2), that for any a 1 < bi, 
a 2 < b 2 the set of primes 



has density equal to Hst([o>i, &i])/4st([o2, b 2 ]) (in other words, the Fourier coefficients 
at primes are independently Sato-Tate distributed). This is expected to hold for any 
pair of non-CM modular forms, such that neither is a quadratic twist of the other 
(and in the case of elliptic curves, Mazur [14, Footnote 12] mentions that there 
is ongoing progress by Harris on this problem). If this holds, it will follow that 
the density of coincidences of signs is always ^ |, which is the probability under 
independent Sato-Tate measures that two "samples" are of the same sign. 

Here is a natural question which is suggested by Theorem 4: estimate the size, as 
a function of the weight and conductor, of the smallest integer n^j 2 for which the 
sign of A/j(n) and A/ 2 (n) are different. If we enlarge slightly our setting to allow f 2 
to be an Eisenstein series such as the Eisenstein series of weight 4: 



where all Hecke eigenvalues are positive, then the question becomes (once more) that 
of finding the first negative Hecke eigenvalue for fi, i.e., the problem considered in 
Theorem 1. Hence, we know that 



where the implied constant is absolute, but it would be interesting to obtain a more 
general version, in particular a uniform one with respect to both f\ and f 2 . 

At least our statistic result (Theorem 2) generalizes immediately if one of the 
forms is fixed: taking & to be the set of all primes, v — 1 and e p = sign A f 2 (p) if 
A/ 2 (p) 7^ 0, and 1 otherwise, we get immediately the following corollary: 

Corollary 3. Let ki,k 2 ^ 2 be even integers and N ± ,N 2 ^ 1 squarefree. For any 
fixed f 2 G H* k2 (N 2 ), there is an absolute positive constant c such that 



{P I A/i(p) e [ fl iA] and \ h (p) G [a 2 ,b 2 ]} 




n>l d\n 



n fl , Ei « (^i) 10/2 \ 



n fuh </ 2 log(fciiVi), 
for all f G H£ (iVi) except for those in an exceptional set with 
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2. Proof of Theorem 1 

Although some ideas are related to those of Iwaniec, Kohnen and Sengupta, the 
proof of Theorem 1 is somewhat easier. 

Thus let / G H£(N), and let y > be such that A/(n) ^ for n ^ y and 
(n,N) = 1. The idea to estimate y is to compare upper and lower bounds for the 
sum 

S(f,x) = A /W> 

n^x 
(n,N)=l 

where restricts a sum to squarefree integers. An upper bound is easily achieved: 
using the convexity bound for Hecke //-functions and the Perron formula, we obtain 

(2.1) S(f,x) < e (k 2 N) 1/4+£ x 1/2+£ (x ^ 1). 

This estimate is independent of any information on y. We note that, more gener- 
ally, if we have 

L(f,l/2 + tt)<^(k 2 N(l + \t\) 2 r, 

for t e K, where n > 0, then we get (2.1) for x ^ Q 2r,+£ (the recent work of Michel 
and Venkatesh [15] provides such a uniform result for some - very small - rj < 1/4, 
and the convexity bound states that any rj > 1/4 is suitable). 

We now proceed to establish a lower bound for S(f, x) by using the assumption. 
For primes p ^ y with p \ N, we thus have Xf(p) ^ 0; furthermore, if p ^ y/y and 
p\N, we have the better bound Xf(p) > 1 because 

\ f (p) 2 = l + \ f (p 2 )>l 

by the Hecke relation. 

We now introduce an auxiliary multiplicative function h = h y defined by 

{—2 if p > y and p \ N 
if y/y <p ^y or p\N 
1 if p ^ y/y and p \ N 

and /^(p") = for v > 2. 

We shall show in a moment the following lemma: 

Lemma 2.1. For any y > 0, define h y as above. Then, for any e > 0, we have 

(2.2) £ = CM(2)-^y«(p(2.) - 2 log.) {l + } 

uniformly for 

(2.3) 1 < u ^ § and y ^ iV 1/3 , 



where 



c^(2)=n( i -p" 2 r 1 
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and p(u) is the Dickman function, defined as the unique continous solution of the 
difference- differential equation 

up'{u) + p{u-l) = (u>l), p(u) = l (0<u^l). 

In particular p(2u) — 21ogw > for all u < k where k is the solution to p(2k) = 
2\ogK. We have k > ^. 

The point of introducing this auxiliary function is that, with notation as before, 
we have the lower bound 

(2.4) S(f,y u )> J>» 

n^y u 

for all u < k, provided y is large enough, e.g., y ^ N 1 ^ 3 with N large enough, which 
we can obviously assume in proving Theorem 4. 

Indeed, let g y be the multiplicative function defined by the Dirichlet convolution 
identity 

\ f =g y * h y , 

then g y (n) > for all squarefree integers n ^ 1 such that (n, N) — 1, since 

9y (P) = X f(p) - h v(p) > 
for all p \ N (the case p ^ y following from Deligne's inequality). We have trivially 

if z ^ y since each term is non-negative in that range, and additionally 

E w > 

for u < k if ?/ is large enough, by Lemma 2.1. 
Hence 

S(f,V u ) = Y! A /H = Y^9y{d) Y! h vW > E^) 

n^y u d^y n Z^y u /d t<y u 

(n,N)=l 

since every term in the sum over d is non- negative and g y {l) = 1. 
We now deduce from Lemma 2.1 

S(f,y u )> E^( n )» i f aF » 

log log TV 

for u < k. Then, a comparison with (2.1) gives the estimate 

y^(k 2 Nf/^ +o{1 l 
Quoting the lower bound for k from Lemma 2.1, we are done. 
Proof of Lemma 2.1. According to the definition of h y , we have 

(2.5) Ev»)= E' l ~ 2 E E b 1 
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for all u and y satisfying (2.3), where we use the convention here that also 

restricts the sum to (n, N) = 1 to simplify notation. 
The second term contributes 

Gv(2) N 1 S ; \ V logl/ 
by standard estimates (the leading constant arises of course because 

£■ Cw(2) N 

(n,N)=l 

uniformly in our range). 

For the first term, if it were not for this condition and the requirement that n be 
squarefree, the lemma would then follow immediately from the well-known property 



E 1^)4+0(4)}, 



of the Dickman function. However, because our uniformity requirements are quite 
modest, it is fairly simple to deduce the stated inequality from this result using 
Mobius inversion to detect the coprimality and squarefree condition (for very general 
bounds of this type, see [26, Th. 2.1], though our requirements are much weaker). 
A numerical computation using Maple leads to k > y. □ 

Remark 1. To estimate k, one can also use the lower bound 

f 2u los(t - 1) 

p{2u) = 1 - log(2u) + / >- dt^l- log(2u) (1 < u ^ |), 

which shows that p(2u) — 2 log-u ^ 1 — log 2 — 3 logu, and 

k > (e/2)V3 > n 

which leads to the exponent ^ = 0.4545 ... in Theorem 1. 

Remark 2. This result is not the limit of the method employed. Precisely, in addition 
to obtained Xf(p) ^ 1 for p < y/y, (p,N) = 1, we can exploit higher powers: write 
Xfip) = 2cos9f(p) with 9f{p) G [0, 7r]. Then if m ^ 1 is an integer, we have for 
1 < j ^ m and yV(«H-i) ^ p < y i/m that 

sm 9 f(p) 

(if p { AT). This implies that 9f{p) ^ 7r/(m + 1), and hence 

A / (p)>2cos-^ T , 

for p { iV with y 1 /^ 4 " 1 ) ^ p < y 1 /™. This can be exploited by boundind £>(/, y) from 
below using a new auxiliary function h supported on squarefree numbers coprime 
to N with 

h(p) = a(-^-) 
WogyJ 
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where a(u) — — 2 for u ^ 1, a(0) = 2 and ct(u) = 2 cos(7r/ (m + 1)) if l/(m + 1) < 
■u < 1/m. One can show an asymptotic of the type 

J2Hn)~Cf3(u)y u (\ogy u ) 

(assuming iV = 1 for simplicity) for some constant C > 0, where the function (3 can 
be described by the following inclusion-exclusion formula: 

(— IV 



with 



dt\ ■ ■ ■ dtj 



/»= f (u-t, tj)^ -a(tj)) 



1=1 J 

integration ranging over the set 

A,- = {(ti, . . . , tj) G [0, +oop | *! + ••• + tj ^ u}. 
This function is also a solution of the integral equation 

u 2 (3(u) = [ t/3(t)a(u - t)dt 
Jo 

(see [5] for related investigations of a class of integral equations of this type). 

To improve Theorem 1, one needs to find (a close approximation to) the first 
positive zero of /3. We have not found a nice way to compute /3 numerically, but this 
would be quite an interesting problem, and its solution is likely to lead to significant 
improvements in the result. We hope to come back to this in the future. 

3. Statistical results 

Our goal is now to prove Theorems 2 and 3. For the first, the main tool is the 
following type of large sieve inequality. 

Lemma 3.1 ([13], Theorem 1). Let v ^ 1 be a fixed integer and let {b p } p be a 
sequence of real numbers indexed by prime numbers such that \b p \ ^ B for some 
constant B and for all primes p. Then we have 

E | E ^|^-m.)(^^) j + (^)-(^) 2j 

fem(N) P<p^Q ^ \ 6 / \ 6 / 

p\N 

uniformly for 

B>0, j ^ 1, 2 | k, 2 ^ P < Q ^ 2P, N ^ 1 (squarefree) . 
The implied constant depends on v only. 

Proof of Theorem 2. The basic idea is that for all forms / with coefficients \f(p u ) 
of the same sign e p , the sums 

SAW 



E 



P<p^2P ^ 
p<^S? 
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exhibit no cancellation due to variation of signs. The large sieve implies this is very 
unlikely to happen, except if the \f{p v ) are very small in absolute value. The Hecke 
relations are used to control this other possibility by relating it to \f(p 2u ) being 
large which can not happen too often either. 3 
For the details, we first denote 

&> N : = {p e & | p \ N}, 

and define 

^(N,P; &>) := {/ G H*(7V) | e p \ f (p v ) > for p e &> N n (P,2P}}, 



E 

P<p^2P 



p 



2z/logP 



} (l ^ v' < v). 



To prove Theorem 2, clearly we only need to show that there are two positive 
constants C = C(u, 3?) and c = c(u, 2?) such that 

log£;iV \ 



(3.1) 
uniformly for 



\<%(N,P; &>)\ <^ kNexp(-c 



log log kN y 



2 | k, N (squarefree), kN ^ X , C\og(kN) ^ P <: (log(fciV)) 

for some sufficiently large number X = X (v, 

The definition of <?£(N, P; and Deligne's inequality allow us to write 
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E E 



/e^ fe *(7V,P;^) 



\f{p 



u\2 



P<psi2P 



p 



2.7 



« E 

fed*(N,P;0>) 



E (v + fyp-ff- 



P<p^2P 

pe.^ N 



2j 



^ E E (»+ l ) e p 



Choosing 



in Lemma 3.1, we find that 



/eH*(7V) P<p^2P 



{y + l)e p i£pe&>, 
otherwise 



p 



2.7 



E E 



(3.2) 



fe£?(N,P;3>) 



X f (p 



v\2 



P<p^2P 

pe@> N 



P 



2.7 



E E o, 

/6H*(JV) P<p^2P 
p\N 



p 



2.7 



<<k N ( 96 ^ +l l 4j \ + ( kN r^p^. 

V PlogP J 



3 Variants of this well-known trick have been used in a number of other contexts, as in [3], but 
note that the large sieve inequality proved there would not work for this problem, due to the lack 
of multiplicative stability of the sign conditions (it would also be much less efficient). 
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In view of the Hecke relation (1.1), the left-hand side of (3.2) is 



£ 



£ \- £ £ 



2.7 



E 



f Ei 



/e^(JV,P;5»)\(U^ =1 ^'(JV,P;5»)) P </| 2P 



p 21ogP 



2j 



Let cu(n) be the number of distinct prime factors of n. Using the hypothesis on 
and the classical inequality 

logn 



we infer that 



E - 



u(n) < {l + o(l)}- , . 

log log n 

^1^1 5 



p 21ogP ^ p p 21ogP 



w(JV) j/2 - 2/C > 



21ogP P " logP 61ogP' 
provided C ^ 6/5. Combining this with (3.2), we infer that 

( ^t±im*z V + (M) ..<. fi . 



K*(JV, P| 5») \ (UJ, =1 C (AT, P; « M 

Now we bound the size of the sets g% (N, P; ^) to finish the proof. Taking 



B = 1, v = 2v', Q = 2P 



and 



1 if p G & 
otherwise 



in Lemma 3.1, we get 

s ^ 



feb) VrWP;^)K e I E 



A/b 2i/ ) 



/eH*(iV) P<p=£2P 
pjiV 



P 



2.7 



Hence, 

(3.3) |4"'(^,P;^)| < A;iV 



fciv /96(2^ + i )2j y wiogrg 

v piogp ; v iogp 

3456z/ 4 jlogPV' 



u'/b \ 2j 



5 2 P 



+ (A;iV) 10/11 P^ (1 < i/ < i/) 



provided P ^ 2(20z//5) 10 /( 3!/ ). 

Combining this with (3.3), we finally obtain 

(3.4) K(N,P; &)\ « fcjv^ 3456 ^ + ^ l0gp y + (^)WHp^ 

uniformly for 

2 |ife, iV (squarefree), Clog(A;iV) < P ^ (log(A;iV)) 10 , j ^ 1. 
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Now, take 

5 Jog(kN) 
logP 

where 5* = 5 2 /(10(u + l)) 4 . We can ensure j > 1 once X is chosen to be suitably 
large. A simple computation gives that 

/3456(i/ + l) 4 jlogP\ j / hgkN \ 

{ ¥p ) <<exp r c ioiioiiiv) 

for some positive constant c = c(u, s ) and P vj <C (kN) 1 ^ 1000 , provided X is large 
enough. Inserting them into (3.4), we get (3.1) and complete the proof. □ 

We now come to the lower bound of Theorem 3. Our basic tool here is an equidis- 
tribution theorem for Hecke eigenvalues which is of some independent interest: it 
shows (quantitatively) that, after suitable average over Hj£(iV), the Hecke eigenval- 
ues corresponding to the first primes are independently Sato- Tate distributed (thus, 
it is related to the earlier work of Sarnak [20] for Maass forms and Serre [21] and 
Royer [19] for holomorphic forms). 

First, we recall the definition (1.4) of the angle 6f(p) € [0, 7r] associated to any 
/ G H£(iV) and prime p \ N. We also recall that the Chebychev functions X n , 
n ^ 0, defined by 

(3.5) x ni o) = Bi " (( ; i + iw 

for 9 E [0,7r], form an orthonormal basis of L 2 ([0, 7r], /jLst)- Hence, for any u ^ 1, 
the functions of the type 

(0i,...,0 w )-> n 

for n,j > 0, form an orthonormal basis of L 2 ([0, 7r] w , //f£). 

Proposition 1. Let N be a squarefree number, k ^ 2 an even integer, s ^ 1 an 
integer and z ^ 2 a real number. For any prime p ^ z coprime with N, let 

s 

Y P {9) = Y t y 9 {j)X i {9) 

3=0 

be a "polynomial" of degree ^ s expressed in the basis of Chebychev functions on 
[0, 7r] . Then we have 

E w / II Y pVf(p))= II y P (0) + O(L7^ J D-(r(iV)log2iV) 2 (iVA; 5 / 6 )- 1 ) 

/6H*(AT) p<z p<* 



where 



( P ,N)=1 (p,N)=l 



u f = 7T~^rrr~F\ nLv ^ ^ t/ie Petersson norm °f /» 

(47r) fe 1 (f,f)(p{N) 

C = max|^ p (j)|, 
pj 



anc? L> ^ 1 and £/te implied constant are absolute. 
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By linearity, clearly, we get an analogue result for 

/GH*(A0 j 

where each tpj is a function which is a product of polynomials as in the statement. 
Proof. Using the fact that for any n p ^ 0, we have 

(3.6) n x nM(p))=*f{ n p np )i 

p^z P^z 
(p,N)=l (p,N)=l 

(which is another form of the Hecke multiplicativity), we expand the product and 
get 

II Y pWf(p))= E ( II y P (Md)))x f (d) 

P^z d\P N {z)° p\P N (z) 

(p,JV)=l 

where v p (d) is the p-adic valuation of an integer and Pn(z) is the product of the 
primes p ^ z, p \ N . 

We now sum over / and appeal to the following Petersson formula for primitive 
forms: 

u> f \ f (m) = 5(m, 1) + 0(m 1 / 4 r(N) 2 (\og2mN) 2 (Nk 5/6 )- 1 ), 

/6H*(7V) 

for all m ^ 1 coprime with N (this is a simplified version of that in [8, Cor. 2.10]; 
note our slightly different definition of Uf, which explains the absence of ip(N)/N 
on the right-hand side); the result then follows easily from simple estimates for the 
sum over d of the remainder terms. □ 

We now deduce Theorem 3 from this, assuming e p — 1 for all p (handling the 
other choices of signs being merely a matter of complicating the notation). 

To simplify notation, we write P = Pn(z) the product of primes ^ z coprime 
with N, and u the number of such primes. 

First, if we wanted only to have A/(p) ^ for a fixed (finite) set of primes 
(i.e., for z fixed), we would be immediately done: Proposition 1 shows 4 that the 
(0f(p)) p \p become equidistributed as kN — > +oo with respect to the product Sato- 
Tate measure, if we weigh modular forms with ujf, and hence 

/6H*(iV) 
p\P^X f (p)^0 

which is of the desired type, except for the presence of the weight. 5 However, we 
want to have 

X f {p) ^ for p ^ z, {p,N) = 1, 

4 For Maass forms, this is essentially one of the early results of Sarnak [20]. 

5 Using the trace formula instead of the Petersson formula (as in [19]), the unweighted analogue 
of Proposition 1 holds with a product of local Plancherel measures, but each still gives measure 
1/2 to the two signs. 
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where z grows with kN, and this involves quantitative lower bounds for approxi- 
mation in large dimension, which requires more care. We use a result of Barton, 
Montgomery and Vaaler [2] for this purpose; although it is optimized for uniform 
distribution modulo 1 instead of the Sato- Tate context, but it is not difficult to 
adapt it here and this gives a quick and clean argument. 6 

Precisely, we consider [0, n]^, with the product Sato- Tate measure, and we will 
write = (0 P ) for the elements of this set; we also consider [0, l] w and we write 
x = (x p ) for elements there. 

For any positive odd integer L, we get from [2, Th. 7] two explicit trigonometric 
polynomials 7 on [0, 1]", denoted A L (x), B L (x), such that 

A L (6M-B L (6M^ J] X (0 P ) 

(p,N)=l 

for all = (0p) G [0, 7r] w , where x(@p) is the characteristic function of [0, tt/2] C [0, n] 
(precisely, we consider the functions denoted A(x), B(x) in [2], with parameters 
N = u and u n — 0, v n — 1/2 for all n ^ u>] since {v n — u n )(L + 1) = (L + l)/2 is a 
positive integer, we are in the situation <3> u <v G Bjy(L) of loc. cit.). 
Thus we have the lower bound 

(3.7) Yl "f> E Uf(AL(e,M-B L (e,/nj), 

feni(N) /sh*(jv) 

Xf(p)^0 for p\P 

where Of = (0f(p)) p . 

Moreover, as we will explain below, A L {0/n) is a product of polynomials over 
each variable, and B L (0/n) is a sum of u such products, and we can now apply 
Proposition 1 (and the remark following it) to the terms on the right-hand side. 
More precisely, we claim that the following lemma holds: 

Lemma 3.2. With notation as above, we have: 

(1) For any e G (0, 1/2), there exists constants Lq ^ 1, and c > 0, such that the 
contribution A of the constant terms of the Chebychev expansions of Al(0/tt) and 
B L (0/ir) satisfies 




if L is the smallest odd integer ^ ctt(z) and if L ^ Lq. 

(2) All the coefficients in the expansion in terms of Chebychev functions of the 
factors in A L (0/ix) or in the terms of B L (0/ir) are bounded by 1. 

(3) The degrees, in terms of Chebychev functions, of the factors of A L (0ji\) and 
of the terms of Bl(0/ix), are ^ 2L. 

Using this lemma, fixing e G (0, 1/2) and taking L as in Part (1) (we can obviously 
assume L ^ L , since otherwise z is bounded) we derive from Proposition 1 that 

Uf{M9fh)-BL{9f/*)) =A + 0(D z ^ z \r(N)\og2N) 2 (Nk 5 / 6 )- 1 ) 

/eH*(7V) 

6 This result was also used recently by Y. Lamzouri [11, §7], in a somewhat related context. 

7 Meaning, standard trigonometric polynomials of the type J2e a ^ e (^ ' x )- 
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for some absolute constants D, with A > (1/2 — e) 71 "^. This is then 3> (1/2 — e) 7 "^, 
provided 

D^ ) (r(iV)log2iV) 2 (A r A; 5/6 )- 1 « (I-e^''' 
This condition is satisfied for 



^ ^ cy/Qog kN) (log log A; A 7 ") 

where c > is an absolute constant, and this gives Theorem 3 when counting with 
the weight ujf. But, using well-known bounds for (/, /), we have 

uj f < A;iV(logA;iV)(loglog6iV) < kN{logkN) 2 , 
with an absolute implied constant. Hence, for z = Ca/ (log kN) (log log kN) , we get 

^jjIV e HJM | A,<ri S, for p < p t AT}| » - 

if kN is large enough, and so we obtain Theorem 3 as stated. 

Proof of Lemma 3.2. We must now refer to the specific construction in [2]. We start 
with Al(x): we have the product formula 

Al(x) = Y[a L (x p ), 
p\p 

where is a trigonometric polynomial in one variable of degree ^ L, i.e., of the 
type 

with &l(0) = 1/2 (see [2, (2.2), Lemma 5, (2.17)]). In particular, the constant term 
(in the Chebychev expansion) for Al(6 fix) is given by 



and we will bound it below. For the moment, we observe further that, from [2, 
Lemma 5], we know that ^ cxl(x) ^ 1 for all x G [0, 1], and so we can simply 
bound all the coefficients in the Chebychev expansion, using the Cauchy-Schwarz 
inequality and ort honor mality: 

/ a L (9/7r)X n (9)d^sT\ [ \a L (e/n)\ 2 dfi ST y< [ \X n (9)\ 2 dfi ST 
Jo Jo Jo 

/*7T /*7T 

«C / d/i 5r x / \X n (6)\ 2 dfi S T = l. 



It is also clear using the definition of X n {9) that the n-th coefficient is zero as 
soon as n + 2 > 2L. 

We now come to B L (x), which is a sum of uj product functions, as already indi- 
cated: we have _ 

b l(x) = Pl{x p ) Yl a L (x q ), 

p\P q\P 
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where (3l(x) is another trigonometric polynomial of degree L, given explicitly by 



^( 2 + 2 E (l-^)(l + (-l)Vs(2^*)), 



2L + 2V ^ V L + 

(see [2, p. 342, (2.3), p. 339]). 

We now see immediately that Part (3) of the lemma is valid, and moreover, we 
see that |/3l(^)| ^ 1, so the same Cauchy-Schwarz argument already used for a L 
implies that Part (2) holds. 

To conclude, we look at the constant term in the Chebychev expansion for B L , 
which is given by 

Using the expression 

= (2 + 2 £ (i-^t)(1 + (-1)')cos(2^)), 

where the second term doesn't contribute after integrating against sin 2 # = (1 — 
cos2#)/2 (the term with i = 1 is zero), we get the formula 

for the contribution of Al(x) — Bl(x). 

Now we come back to a lower bound for the constant term for a^. The point 
is that, as L — > +00, o.l converges in L 2 ([0, 1]) to the characteristic function x of 
[0, 1/2]: from [2, (2.6)], and the definition of «£,, we get 

- ai,(a;)| ^ ^l(x), ^ x ^ 1, 

and from the Fourier expansion of (3 l we have 

II&II ^(2lW X(4L + 4) "°- 

Hence, we know that 

-/ a L (9/7r)sm 2 6d6 ^ x(^)d^r = l/2. 
71" Jo Jo 

For given £ e (0, 1/2), the integral is > (1/2 — e/2) if L > L , for some constant 
L . Then, if L + 1 ^ 2£ _1 w, we derive 

MHHH>(H" 

which gives Part (1) of the lemma. □ 
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4. Proof of Theorem 4 

The simple idea of the proof of Theorem 4 is that the assumption translates to 
A/ 1 (p)A/ 2 (p) ^ for all primes p (with few exceptions). However, it is well-known 
from Rankin-Selberg theory that if /i ^ / 2 , we have 

("I E M^M = 0(1) (,-1+) 

p 

(see, e.g, [7, §5.12] for a survey and references; the underlying fact about auto- 
morphic forms is due to Mceglin and Waldspurger) . Thus we only need to find a 
lower bound for the left-hand side (which is a sum of non-negative terms) which is 
unbounded as a tends to 1+. Since Rankin-Selberg theory also gives 

(4.2) ^^!^_l og(cr _i) (,->!+), 

p 

the only difficulty is that one might fear that the coefficients of f\ and f 2 are such 
that whenever A/ X (p) is not small, the value of \f 2 (p) is very small. 8 In other words, 
we must show that the smaller order of magnitude of (4.1) compared with (4.2) is 
not due to the small size of the summands, but to sign compensations. For this we 
use the following trick which exploits the little partial information known towards 
the pair Sato- Tate conjecture. 

Assume first that fi and f 2 are non-CM cusp forms, and that neither is a quadratic 
twist of the other (in particular, fi ^ f 2 ). By Ramakrishnan's Theorem ([18, Th. 
M, §3]), there exists a cuspidal automorphic representation n on GL(4)/Q such that 

L(tt,s) = L(f x x f 2 ,s), 

and consequently, by Rankin-Selberg theory on GL(A) x GL(A) now (the fact that 
L(7i x 7f, s) has a single pole at s = 1), we have 

(«) E (V(P)W , E 1 +0(1)| 

p V p P 

However, if we denote by E the set of primes p for which Xf(p)X g (p) < 0, we have 

(A/^a^p)) 2 _ ^ (\ h (p)^fM) 2 (A/»a /2 ( p )) 2 

p F p(£E F peE F 

Z— / irfJ Z— / irfJ 

p$e F peE F 

by Deligne's bound. Then the first sum can also be written 

A/i(p)A/ 2 (p) _ A ^^h(p) x h(p) , Y^A/i(p)A/a(p) 



4 >p A hVP) A hVP) = _ 4 *hVP) A hVP) + >p 
Z_v rrP Z_v rfj Z_v 

peE F 



p^e ^ peE ^ p f 



r, P 

peE r 



using once more Deligne's bound and the assumption f\ 7^ f 2 to apply (4.1). 



8 See the remark after the proof for an example of which potential situations must be excluded. 
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Comparing (4.3) with these two inequalities leads to 



p a - 32 ^ p 

i.e., the set of primes where the signs of /i and f 2 differ has analytic density ^ 1/32. 

There remains to consider Part (1) of Theorem 4 when one of the forms is of CM 
type (and the exceptional set E now has density 0). We will be brief since there are 
less difficulties here. The main point is the following well-known result concerning 
the distribution of the angles 0f(p) for a CM form / G H£(iV), with k ^ 2: there 
exists a real, non-trivial, primitive Dirichlet character Xf such that A/(p) = when 
Xfip) = ~ 1 ( a se t °f primes If of density 1/2), and for p If, the 0f{p) G [0, n] for 
p ^ x become uniformly distributed as x — > +oo, i.e., we have 



7r(a;) 

for all non-zero integers m G Z (see, e.g., [17, p. 197], where this is explained for 
elliptic curves, with slightly different notation). In particular, for any a > 0, the 
density of the set of primes where |A/(p)| > a exists and is equal to 

— arccos(a/2) 

7T 

and this density goes to 1/2 as a — > 0. 

Now assume f\ is a CM form and f 2 is not; according to Lemma 4.1 below, we 
find a > and a set of primes P 2 of analytic density 5 > 1/2 where |A/ 2 (p)| > a, and 
then the set P 2 H 1^ has analytic density > 0, thus for small enough a', it contains 
a set G with positive analytic density where lA^Qo)! > a'. Hence we have 



E 



A/x(p)A/ 2 (p) ^ j2 X hip) x fM , (i og | a ij-i) 



^ a«' ^ — + o(log \a — 1| x ), as o~ — y 1+, 



which is in fact a contradiction (since /i can not be equal to / 2 )- 

Finally, assume / x and f 2 are CM forms. Because of independence of primitive real 
characters, the union If 1 Ui/ 2 has density at most 3/4 (the complement contains the 
set of primes totally split in a Galois extension of Q of degree at most 4). For small 
enough a > 0, the complement must contain a set of primes of positive analytic 
density where lA^Qo)! > a, |A/ 2 (j9)| > a, and we can conclude as before that the 
Rankin-Selberg convolution has a pole at s = 1, so that /i = f 2 in that case also. 

Remark 3. In the first version of this paper, we did not use Ramakrishnan's theorem, 
but managed to prove a weaker version of Part (1) of Theorem 4 using only the 
Rankin-Selberg properties of j\ and f 2 together with the analytic properties of 
(small) symmetric square L-functions. We sketch the argument, since this may be 
of interest in other contexts. 

The basic point is the following lemma, which may be of independent interest: 
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Lemma 4.1. Let N ^ 1 be an integer, k ^ 2 be an even integer and f G H* k (N) a 
primitive cusp form of level N and weight k which is not of CM type. Then there 
exists a constant a > and d~ > | such that 



\\ f (p)\>a y V 

for a > 1. In /act, one can £a£e a = 0.231 and 5 = \ + ^. 

Proof. It is convenient here to work with the Chebychev polynomials C/„ instead 
of the Chebychev functions X n considered in the previous section: recall that for 
n ^ 0, we have 

X n {9) = U n (2cos0) 

where U n G M.[x] is a polynomial of degree n. Then (3.6) gives U n (\f(p)) = A/(p n ) 
for any / G H£(7V), p f iV, and n > 0. 

We then claim that there exists a polynomial 

Y = A, + &C/ 2 + ^ 4 C/ 4 + foU 6 G R[x] 
with the following properties: 

0)A)>§; 

(ii) for some a > and x G [—2, 2], we have 

(4.4) Y(x) ^ X a(x), 

where A := {x G [-2,2] | \x\ > a}. 

Assuming this, we conclude as follows: by (ii), we have 

^ 1 > y r Y(X f (p)) _ V J_ ^ V ^(A/(p)) 
Z / ^o- ^ Z -> ^o- ^° Z pCr Z -> Z / ^o- 

By the holomorphy and non-vanishing at s = 1 of the second, fourth and sixth 
symmetric power L-functions (see [10, Th. 3.3.7, Prop. 4.3] for the last two, noting 
that non-CM forms are not dihedral, and [22] for a survey concerning those L- 
functions), since U n (\f(p)) is exactly the p-th coefficient of the n-th symmetric 
power for p\ N, standard analytic arguments show that 

£ W)) =Q(1) 

p\N P 

for a ^ 1 and i = 1, 2, 3. Hence the result follows with 5 — j3o > \. 

Now to check the claim, and verify the values of a and 5, we just exhibit a suitable 
polynomial, namely 



V — I _|_ J_ _|_ ITT _ ITT _|_ J3§.TT — _LL™6 _ _93_ 4 , 227 2 _ _283 
2 ~ r 24 ~ r 4 U2 4 U4_r 1000 6 125 100 ^ 125 3000' 

since 



(4.5) 



C/o = l, Ui=x, U 2 = x 2 -l, U i = x i -3x 2 + l, 
U 5 = x 5 - Ax 3 + 3x, U 6 =x 6 -5x 4 + 6x 2 - 1. 

This polynomial is even, and its graph on [—2, 2] is in Figure 1. 
The value of a is an approximation (from below) to the real root 

a = 0.23107202470801418176315245050693402580 . . . 



ON MODULAR SIGNS 



21 




Figure 1 



of Y in [0, 2]; the maximum value of V on [0, 2] is very close to 1. □ 

The upshot of this lemma is that, given fx and f2 (not of CM type), there exists 
a set of primes of analytic density > for which both |A/j(p)| and |A/ 2 (p)| have a 
positive lower bound. Then the sum of A/ 1 (p)A/ 2 (p) over this set can not be small, 
and this leads to an upper bound for the density of the "exceptional set" . However, 
the actual value from the above lemma is much smaller than what Theorem 4 uses 
(it is about 1/1000). 

Another interesting point of this method is that using the sixth symmetric power 
(and thus the deep results of Kim and Shahidi) is necessary for Lemma 4.1. For 
this, note that the sequences {x p } ppT i mes and {y p }pp r i me s defined by X2 = yi = and 
for primes p ^ 3 by 



Xp 



y P 




if p = 3 (mod 4), 
if p = 1 (mod 4), 

(_l)(p-3)/4^2 ifp = 3(mod4), 
ifp=l(mod4), 



have the "right" moments of order 1 to 5 for being Sato- Tate distributed, 9 i.e., we 
have 

^^^ = 0(1) and ^^#^ = 0(1) 

for <t > 1 and 1 ^ k ^ 5, and yet x p y p ^ for all p, in fact x p y p = 0, so that we 
most certainly have 

x p y p 



£^ = 0(1) (a>l) 



p 



Remark 4. As a final remark, one can think of other ways (than looking at signs) of 
reducing Fourier coefficients of modular forms to a fixed finite set: the most obvious, 
at least if / has integral coefficients A/(n)n^ -1 )' 2 , is to look at the coefficients modulo 



9 The sixth moment fails: it is 4 instead of 5 for the Sato- Tate distribution. 
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some fixed prime number i. However, the situation there can be drastically different: 
for instance, for all (infinitely many) elliptic curves with full rational 2-torsion, given 
for instance by equations 

y 2 = (x — a)(x — b)(x — c) 

with a, b, c distinct integers, the reduction modulo 2 of the odd prime coefficients 
of the corresponding L-function (or modular form) is the same! 
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